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REMARKS 

The Notice of Non-Compliant Amendment states that the claims filed on April 16, 2004 
do not comply with the requirements of 37 C.F.R. § 1.121(c) because claims 28 and 29 are listed 
as "previously presented" but in fact are "currently amended." In response, Applicants are 
submitting an Amendment which is substantially similar to the Amendment filed on 
April 16, 2004, but which has been amended to correct this inadvertent error. 

After entry of this Amendment, claims 14, 16-20, 23-29, 31-32 and 35-37 will be 
pending and under consideration in this application. 

Claims 14, 16, 19-20, 23-29 and 31-32 have been amended. Claims 35-41 have been 
added. Support for the claim amendments and for the new claims can be found throughout the 
specification, (The specific support for the claim amendments and new claims will be discussed 
below.) Applicants respectfully submit that none of the claim amendments add new matter to 
the application. 

Claims 1, 3-4, 7-9 and 12-13 have been cancelled without prejudice. Applicants reserve 
the right to file an application which claims priority to the instant application and contains the 
subject matter of the cancelled claims. 

Indefiniteness Rejections under 35 U.S.C. SI 12, %2 

The Examiner has rejected claims 1, 3, 4, 7-9, 12-14, 16-20, 23-29, 31 and 32 under 35 
U.S.C. §1 12, 1J2, as being indefinite due to their recitation of M64347_at. The Examiner states 
that M64347_at is a GenBank Accession No. and therefore an object which is variable. 

Claims 1, 3-4, 7-9 12 and 13 have been cancelled and, therefore, the rejection with 
respect to these claims has been obviated. 

Applicants respectfully traverse the rejection with respect to claims 14, 16-20, 23-29, 3 1 
and 32, which have been amended to refer to an informative gene comprising certain nucleotides 
of GenBank Accession No. M64347. Support for this amendment can be found at in the 
specification which discloses that the Affymetrix "HuGeneFL array" was used, and that 
M64347_at marker was upregulated. See Amendments to the Specification submitted in July 30, 
2003, under the heading "Microarray hybridization" and Table 1 . As indicated in the Affymetrix 
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website ( Exhibit A ), M64347_at refers to a set of probes which detect nucleotides 3336-3720 of 
GenBank Accession No. M64347. 

Applicants submit that reference to a GenBank Accession No. does not render the claims 
indefinite. A person of ordinary skill in the art would be able to determine the nucleotide 
sequence of a gene by reference to its GenBank Accession No. Further, while a GenBank 
Accession No. can be revised, a person of ordinary skill in the art would be to identify any 
revisions made to a sequence over time. Attached as Exhibit B is print-out from the National 
Center for Biotechnology Information ("NCBI") website which summarizes their policy with 
respect to sequences revisions (see page 1 of the Exhibit), and states that a person reviewing the 
records for a particular GenBank Accession No. would be able to determine whether a particular 
sequence has been revised and would be able to access previous versions of the sequence (see 
page 3 of the Exhibit). 

Furthermore, a search of the United States Patent and Trademarks Office ("USPTO") 
granted patents database reveals that the USPTO has granted patents containing claims which 
recite sequences by references to their GenBank Accession Nos. 1 Thus, Applicants respectfully 
request that the Examiner withdraw this rejection. 

The Examiner has also rejected claims 12-13, 24-25, 28 and 29 for referring to Table 1 
and Tables 2-6. Further, the Examiner states that the reference to the genes in Tables 1-6 by 
reference to their GenBank Accession Nos. renders the claims indefinite. 

Claims 12 and 13 have been cancelled and, therefore, the rejection with respect to these 
claims has been obviated. 

Claims 24-25, 28 and 29 have been amended to replace the reference to the Tables with 
reference to the GenBank Accession Nos. for the genes disclosed in the Table. Support for this 
amendment can be found at Tables 1 and 6. Table 1 lists the genes by reference to the probes 
used to detect the genes disclosed in Table 1, which correlate to the GenBank Accession Nos. for 
the genes disclosed in the Table 1 . Exhibit C provides information obtained from the 
Affymetrix website which shows such correlation. Note that Exhibit C does not provide the 
information for all of the probes disclosed in Table 1 since the purpose of the exhibit is to 



1 The query used was: "GenBank Accession Number" or "GenBank Accession No." in claims. This search resulted 
in 18 hits. Among the relevant hits were: U.S. Patent Nos. 6,667,065, 6,627,193, 6,468,773, and others. 
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demonstrate that the reference to the probes in Table 1 correlates with the GenBank Accession 
Nos. of the genes disclosed in Table 1. 

Applicants traverse the Examiner's statement that references to GenBank Accession Nos. 
render the claims indefinite for the reasons discussed above. Accordingly, Applicants 
respectfully submit that the scope of claims 24-25, 28 and 29 is definite, and request the 
Examiner to withdraw this rejection. 

Applicants note that claims 40 and 41 have been added. Support for the claims can be 
found throughout the specification, particularly at Tables 1 and 6. Applicants respectfully 
submit that claims 40 and 41 are definite. 

The Examiner has rejected claim 1 as indefinite "for failing to [show the] how the 
expression profile is correlated with a specific brain tumor type." Further, the Examiner states 
that "the metes and bounds of two or more informative genes beyond the M64347" is unclear. 
Claim 1 has been cancelled and, therefore, this rejection has been obviated. 

The Examiner has rejected claims 8, 9, 19 and 20 as indefinite because they "lack active 
method steps, as the recitation of 'utilizing' does not constitute a specific method step." 

Claims 8 and 9 have been cancelled and, therefore, the rejection with respect to these 
claims has been obviated. Applicants respectfully traverse this rejection with respect to claims 
19 and 20. Applicants submit that the recitation of "utilizing" in claims 19 and 20 is not 
intended to be a further method step. Instead the recitation of utilizing provides a further 
definition of how to determine a gene expression profile. Applicants have amended the claims 
to improve their form and clarify this point. Accordingly, Applicants respectfully request that 
the Examiner withdraw this rejection. 

The Examiner has rejected claim 14 as indefinite because the use of the term "the 
sample" lacks antecedent basis. Claim 14 as amended refers to "the brain tumor" rather than 
"the sample." This amendment renders the rejection moot. 

The Examiner has rejected claim 23 as indefinite for its recitation of "survival after 
treatment" as the predicted treatment outcome. Applicants have amended claim 23 to require the 
predicted treatment outcome to be a good prognosis of survival after treatment or treatment 
failure. Support for amended claim 23 can be found at page 9, lines 23-26 of the Substitute 
Specification. This amendment renders the rejection moot. 
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The Examiner has rejected claims 26 and 27 as indefinite because of their recitation of: 
"informative genes", "magnitude", "class distinction", "winning", and "summing the votes", 
"the sample to be tested", "first class" and "second class". {See Office Action, pages 3-4.) More 
specifically, with respect to claim 26, the Examiner states that "it is unclear how the 'magnitude' 
of the vote is to be determined because 'depending on the expression level of the gene' does not 
accurately define the mathematical relationship between the gene expression and the magnitude 
of the vote" (Office Action, page 3). 

In response to the Examiner's rejections of claims 26 and 27, Applicants traverse in part 
and amend in part. Applicants have amended these claims to improve their form and to more 
clearly define the claimed invention. Applicants respectfully submit that the amended claims 
would be considered definite by a person having ordinary skill in the art. 

The claims as amended require calculating the weighted vote of each informative gene. 
According to the specification, "informative genes" refers to "genes whose expression correlates 
with a particular phenotype " {See Substitute Specification, page 8, lines 23-25 and page 9, lines 
13-16.) Thus, with respect to claims 26 and 27, an informative gene is one which correlates with 
treatment outcome. 

Further, the claims have been amended to clarify "class distinction", "first class" and 
"second class". Applicants respectfully submit that based on the information disclosed in the 
specification and the knowledge in the art, a person of skill in the art at the time the application 
was filed would be able to calculate the weighted vote for an informative gene, and to sum up 
the votes to determine a winning class as required by claims 21 and 22. The weighted voting 
algorithm was well known in the art at the time the application was filed as evidenced by the fact 
that the specification cites to three references which use this method. See Substitute 
Specification, page 32, lines 27-28 and Amendments to the Specification submitted in July 30, 
2003, under the heading "Weighted Voting", citing to: U.S. Application No. 09/544,627 (now 
issued as U.S. Patent No. 6,647,341), Golub 1999, and Slonim 2000. 

Finally, the Examiner states that claims 26 and 27 are vague and indefinite because it is 
unclear if the level of gene expression used in the computation is a normalized or non- 
normalized level. Applicants respectfully traverse. Applicants submit that it is irrelevant to the 
claimed methods, and that a person of skill in the art would know whether the gene expression 
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level should be normalized or non-normalized in a particular instance. Accordingly, Applicants 
respectfully request that the Examiner withdraw this rejection. 

Enablement Rejections under 35 U.S.C. $112, f 1 

The Examiner has rejected claims 1, 3, 4, 7-9, 12-14, 16-20 and 23-29 for lack of 
enablement. 

Applicants have cancelled claims 1, 3, 4, 7-9, 12 and 13, directed to methods of 
classifying a brain tumor. Therefore, the Examiner's rejections with respect to these claims have 
been obviated. 

Applicants traverse the Examiner's enablement rejection with respect to claims 14, 16-20 
and 23-29, directed to methods of predicting the efficacy of treating a brain tumor, methods for 
predicting a treatment outcome of a patient with a brain tumor, methods for evaluating drug 
candidates for their effectiveness in treating a brain tumor or methods for monitoring the 
efficacy of a brain tumor. 

According to the Examiner, "[t]here are no teachings in the specification to correlate a 
value which is several standard deviations from the mean with a method of predicting the 
efficacy of a brain tumor. More specifically, the Examiner states that: (1) the specification does 
not teach whether the expression of M64347 as shown in Figure 3C was obtained from the brain 
tumor before treatment or after treatment, (2) it is unclear if the lowered expression of M64347 
is indicative or predictive of treatment failure or a treatment success as the title of Figure 3C is 
"Markers of Treatment Failure" but the heading of Table 1 is "Markers Downregulated with 
Low Risk" and (3) the specification does not define how the CI or CO groups were differentiated 
and does not teach what constitutes a treatment failure or success in terms of disease free 
survival or length of survival. 

As mentioned above, Applicants traverse. First, the specification teaches that expression 
of M64347 as shown in Figure 3C was obtained from the brain tumor before treatment. See 
Substitute Specification, page 40, line 28 to page 41, line 2. 

Second, the specification (Table 1, Table 6 and Figure 3C) shows that the upregulation of 
M64347 is correlated with a "high risk class" of individuals (e.g., a class of individuals with 
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poor prognosis for survival after treatment). See Substitute Specification, page 9, lines 13-16 
stating that "a sample can be classified as belonging to a high risk class (e.g., a class with poor 
prognosis for survival after treatment) or a low risk class (e.g., a class with good prognosis for 
survival after treatment)." Thus, the heading of Table 1 - "Markers Upregulated in High Risk, 
Downregulated in low Risk" - is not inconsistent with the heading of Figure 3C - "Markers of 
Treatment Failure" - as it appears to be suggested by the Examiner. 

Third, the specification describes that in Figure 3 CO and CI correspond to two 
unsupervised SOM-derived clusters, and that Class CI tumors are notable for their high 
ribosomal content. See Substitute Specification, page 7, lines 2-4. The specification further 
states that the CO and CI groups were not correlated with patient survival. See Substitute 
Specification, page 41, lines 13-17. 

Finally, the specification teaches what constitutes treatment failure or success in terms of 
disease free survival or length of survival. The specification states that they differentiated 
"patients who are alive following treatment ('survivors') compared to those [patients] who 
succumbed to their disease ('failures'; minimum follow-up 24 months for surviving patients; 
overall median 41.5 months)." See Substitute Specification page 41, lines 17-21. 

The Examiner also states that "[t]here is no guidance for a specific polynucleotide probe 
and hybridization conditions to be used in the determination of an expression profile for . . . the 
method of predicting the efficacy of treatment." The Examiner noted there are different 
isoforms of FGFR3, the gene encoded by M64347, and stated that a probe to this gene could 
hybridize to any number of the polymorphic gene products or alleles. The Examiner concluded 
that the "specification provides no teachings as to the exact nature of the probe used for the 
expression profile, thus it cannot be construed from the specification which polymorphic 
variants, splice variants or alleles are integral to the claimed invention." 

Applicants note that not all of the claims recite the use of a probe and/or require the use 
of hybridization condition to determine an expression profile. In any case, Applicants 
respectfully traverse the Examiner's rejection to the extent that certain claims require the use of 
a specific polynucleotide probe and hybridization conditions. 

Contrary to the Examiner's assertion, the specification teaches the exact nature of the 
probes used to determine the expression profiles for the classification of a brain tumor, or the 
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method of predicting the efficacy of treatment. The specification states that they used 
Affymetrix's HuGeneFL array. {See Amendments to the Specification submitted in July 30, 
2003, under the heading "Microarray hybridization.") Based on this disclosure, a person of skill 
in the art would have been able to identify the probes present in the array and used in the 
specification to determine the expression profiles for the method of predicting the efficacy of 
treating a brain tumor. 

More specifically, based on the disclosure, a person skilled in the art would have been 
able to determine the specific probe used to determine the expression profile of M64347, and the 
other informative genes disclosed in the specification. The claims have been amended to recite 
an informative gene comprising nucleotides 3336-3720 of GenBank Accession No. M64347, 
which are the nucleotide sequences in Affymetrix's HuGeneFL array. See Exhibit A (obtained 
from Affymetrix's website). Thus, Applicants submit that the claims, as amended, are enabled 
with respect to the probes which can be used to practice the methods of the claimed invention. 

Applicants respectfully submit that, in view of the specification, which teaches that the 
expression profile of an informative gene which hybridizes to nucleotides 3336 to 3720 of 
GenBank Accession No. M64347 correlates with efficacy of treating a brain tumor, the fact that 
there are different allelic variants or isoforms of the gene encoded by GenBank Accession No. 
M64347 is irrelevant. 

Applicants respectfully submit that a person of ordinary skill in the art would know what 
hybridization conditions should be employed to determine the expression profile of the 
informative genes of the claims. Moreover, the specification teaches the hybridization 
conditions used in the experiments disclosed in the specification. {See Amendments to the 
Specification submitted in July 30, 2003, under the heading "Microarray hybridization.") Thus, 
Applicants submit that the claims are enabled with respect to the hybridization conditions useful 
in practicing the methods of the claimed invention. 

With respect to claims 26-29, the Examiner states that "the specification does not define 
the parameters needed to calculated weighted vote for M64347." Applicants respectfully 
traverse. As discussed above, based on the information disclosed in the specification, a person 
of skill in the art would know how to determine weighted vote as recited in the claims without 
undue experimentation as evidenced by the fact that the specification refers to a patent 
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application (U.S. Application No. 09/544,627, now issued as U.S. Patent No. 6,647,341) and two 
papers (Golub 1999 and Slonim 2000) that disclose the use of this weighted voting algorithm 
before the instant application was filed. (See Substitute Specification, page 32, lines 27-28 and 
Amendments to the Specification submitted in July 30, 2003, under the heading "Weighted 
Voting.") 

The Examiner states that "Applicants arguments regarding the teachings of Golub et al. 
for methods of determining class and subclass as set forth in U.S. application No. 09/544,627 are 
unpersuasive" because the instant application could issue before the referenced application. (See 
Office Action, page 7.) Applicants note that the referenced application has now issued as U.S. 
Patent No. 6,647,341 . Applicants have amended the application accordingly. 

In view of the arguments presented above, Applicants respectfully request that the 
Examiner withdraw the enablement rejections. 

Obviousness Rejections 

The Examiner has rejected claims 31 and 32 under 35 U.S.C. § 103(a) as being 
unpatentable over U.S. Patent No. 6,500,938 ("Au-Young") in view of Abbass et al., 1997, J. 
Clin. Endocrinol Metab., 82:1 160-1 166) ("Abbass"), or over U.S. Patent No. 6,218,122 
("Friend") in view of Abbass. 

The Examiner alleges that Au-Young teaches methods of monitoring the progression of a 
disease or the efficacy of a treatment comprising detecting an expression profile by means of a 
micro array. The Examiner alleges that Friend teaches methods for detecting changes in a 
biological state of a subject which are correlated to one or more disease states and methods for 
monitoring the efficacies of a therapy comprising the determination of an expression profile 
from said cells in a patient. However, as admitted by the Examiner, neither Au-Young nor 
Friend teaches the expression profile of M64347 or the FGFR3 encoded thereby. 

The Examiner alleges that Abbass teaches "that the expression of the mRNA encoding 
the secreted form of FGFR3, which would be expressed from the M64347_at gene, is correlated 
with pituitary adenomas." Further, as admitted by the Examiner, Abbass does not teach a 
correlation between the expression profile of FGFR3 and tumor type, size or aggressiveness. 
(See 12/31/03 Office Action, page 5.) 
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It is respectfully pointed out to the Examiner that a proper rejection based on 35 U.S.C. 
§103 that relies on a combination of prior art references requires a teaching, suggestion, or 
motivation to combine the teachings of the references; a reasonable expectation of success 
founded in the cited art of producing the claimed invention; and that such proper combination 
teaches or suggests all elements of the claimed invention. Applicants respectfully traverse the 
obviousness rejections for failing to meet all of these requirements for the reasons provided 
below. 

Claims 31 and 32, as amended, recite methods for evaluating drug candidates for their 
effectiveness in treating brain tumors or methods for monitoring the efficacy of a brain tumor 
treatment, wherein the brain tumor is selected form the group consisting of mellanoblastomas, 
glioblastomas, rhabdoid tumors, primitive neuroectodermal tumors, and pineoblastomas. 
Support for this amendment, and for newly added claims 35-39, can be found throughout the 
specification. See, e.g,, Substitute Specification, page 3, lines 1-2. 

Applicants respectfully submit that there is no motivation to combine the references cited 
by the Examiner to reach the invention of amended claims 31 and 32. The Examiner states that 
"one of ordinary skill in the art would have been motivated to [combine the references] with a 
reasonable expectation of success by the teachings of Abbass et al. on the unique expression of 
the secretable form of FGFR3 mRNA in pituitary adenomas versus the lack of expression of the 
secretable form of this receptor in normal pituitary." However, none of the references teach the 
correlation between the gene expression profile of M64347 and effectiveness in treating a brain 
tumor selected from the group consisting of mellanoblastomas, glioblastomas, rhabdoid tumors, 
primitive neuroectodermal tumors, and pineoblastomas, or monitoring the efficacy of a treatment 
for any of the mentioned brain tumors. Therefore, there would be no motivation to combine the 
references as argued by the Examiner. 

Further, even if the references were to be combined as suggested by the Examiner, the 
combination of references would not teach or suggest the inventions of claims 3 1 and 32. 
Rather, the combination of the references would at best teach the use of M64347 to evaluate 
drug candidates for their effectiveness in treating a pituitary adenoma, or to monitor the efficacy 
of a pituitary adenoma treatment. Nothing in the cited references suggests or discloses the use of 
M64347 to evaluate drug or monitor the efficacy of a drug to treat the brain tumors of the 
claims. Moreover, even if the references were combined, there would be no reasonable 
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expectation of success. Accordingly, Applicants respectfully request that the Examiner 
withdraw the obviousness rejections. 

Conclusion 

In view of the above, each of the presently pending claims in this application is believed 
to be in immediate condition for allowance. Accordingly, the Examiner is respectfully requested 
to pass this application to issue. 

Applicant believes no fee is due with this response. However, if a fee is due, please 
charge our Deposit Account No. 18-1945, under Order No. WIBL-P01-561 from which the 
undersigned is authorized to draw. 

Dated: July 15, 2004 Respectfully submitted, 

Gloria Fuentes 

Registration No.: 47,580 
ROPES & GRAY LLP 
45 Rockefeller Plaza 
New York, New York 1011 1-0087 
(212) 497-3624 
(212) 497-3650 (Fax) 
Attorneys/ Agents For Applicant 
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isoform 1 precursor; hydroxyaryl- 
protein kinase; tyrosine kinase JTK4 
[Homo sapiens] 



0.0 



0.0 



0.0 



20452380 



0.0 



Method :. ID 
Hanks FGFR-3 



Protein 
Families 



ec 



Hanks FGFR-3 



ec 



Description 

FGR3_HUMAN (FGFR-3) KINASES:5:6' 3 | 
PTK Group B membrane spanning protein 
tyrosine kinases.PTK XV Fibroblast growth 
factor receptor family .FGFR-3 

ZA70 HUMAN ZA70 HUMAN EC:2.7. 1.11 2:TYROSINE- 

PROTEIN KINASE ZAP-70 (EC 2.7.1 .112) (70 
KDA ZETA-ASSOCIATED PROTEIN) (SYK- 
■ RELATED TYROSINE KiNASE). 
FGR3_HUMAN (FGFR-3) KINASES:5.6.3 | . 
PTK Group B membrane spanning protein 
tyrosine kinases.PTK XV Fibroblast growth 
factor receptor family .FGFR-3 

ZA70 HUMAN ZA70 HUMAN EC:2.7.1 .11 2:TYROSlNE- 

PROTEIN KINASE ZAP-70 (EC 2.7.1 .112) (70 
KDA ZETA-ASSOCIATED PROTEIN) (SYK- 
RELATED TYROSINE KINASE). 



E-Value 

1.0E- 
166 



7.38E- 
99 



1.0E- 
167 



7.38E- 
99 



Database ID 
scop dlqjoa 



scop 
scop 



d1ev2e1 
dlgjoa_ 



Description E-Value 

d1gjoa^SCOP:d.144.i ;2:| Fibroblast growth 3.81E- 

factor receptor 2 : 81 . ; 

d1ev2e1 SCOP;b. 1.1. 4:| Fibroblast growth 4.95E- 
factor receptor, FGFR . : . 21 

d1gjoaj.SCOP:d.144.1.2:| Fibroblast growth 3.81E- 

factor receptor 2 81 
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Protein 
Domains 



scop 


d1ev2e1 


dlev2e1 SCOP:b.T.1.4:| Fibroblast growth 


4.25E- 




fartnr rpeiebtor FGFR 


21 : 


JJIdlll 


in • 


Immnnnnlnhulin domain - ' 


1.6E-5 


pidfTl 


in 


immi innnlnhi ilin dnmain 
ii i ii (iui luy luuuiii i uuinciii i 




pial 1 1 




Prntpin kinase domain 


2.3E-92 


pfam 


ia 


Immunoglobulin domain 


1 .Ofc-0 


■pfam 


ia 


Immunoglobulin domain 


3.2E-8 


pfam 


okinase 


Protein kinase domain . 


2.3E-92 


pfam 


ia 


Immunoglobulin domain 


7.3E-8 


InterPro 


IPR000719 


Protein kinase 






EMBL-EBI 






InterPro 


IPR007110 Immunoglobulin-like 






EMBL-EBI 






InterPro 


; IPR001245 Tyrosine protein kinase . 






EMBL-EBI 






InterPro 


JPR008266 Tyrosine protein kinase, activd site 






EMBL-EBI 






InterPro 


IPR003598 Immunoglobulin C-2 type 






EMBL-EBI 







Target 
Sequence 



Trans Membrane 

■ ; ID 
NP 000133 



Number Of 
Domains 



Probability of Interior N -Terminus 
0.11005 



■ Sequence . 

>HUGENEFL : M6 434 7_AT 

gacttcaaagcaagctggtattttcatacaaattcttctaattgctgtgtgtcccaggca 
gggagacOTtttccagggaggggccggccctgtgtgcaggttccgatgttattagatgtt 
acaagtttatatatatctatatatataatttattgagtttttacaagatgtatttgttgt 
agacttaacacttcttacgcaatgcttctagagttttatagcctggactgctacctttca 
aagcttggagggaagGcgtgaattcagttggttcgttctgtactgttactgggccctgag 
tctgggcagctgtcccttgcttgcctgcagggccatggctcagggtggtctcttcttggg. 
gcdcagtgcatggtggccagaggtgtcacccaaaccggcaggtgcgatt . 



Probe Info 



Probe Sequence(5'-3') 


Probe 
X 


Probe 
Y 


Probe 
Interrogation 


Strandedness 






Position 




GACTTCAAAGCAAGCTGGTAI I I I G 


359 


161 


3348 


Antisense 


catacaaattcttctaattgctgtg 


360 


161 


3372 


Antisense 


AATTCTTCTAATTGCTGTGTGTCCC 


361 


161 


3378 


Antisense 


TGCTGTGTGTGCCAGGCAGGGAGAC 


362 


161 


3390 


Antisense 


tGTGTGCAGGTTCCGATGTTATTAG 


363 


161 


3438 


Antisense 


TCTTACGCAATGCTTCTAGAGTTTT 


364 


161 


3540 


Antisense 


GCAATGCTTGTAGAGTTTTATAGCC 


365 


161 


:' 3546 


Antisense 


GAG I I I I ATAGCCTGGACTGCTACC 


366 


161 


3558 


Antisense 


TGCTACCTTTCAAAGGTTGGAGGGA 


367 


161 


3576 


Antisense 


AAGCTTGGAGGGAAGCCGTGAATTC 


368 


161 


3588 


Antisense 


TGAATTCAGTTGGTTCGTTCTGTAC 


369 


161 


3606 


Antisense 


GTTCGTTCTGTACTGTTACTGGGCC 


370 




, 3618 


Antisense 


CTGGGCCCTGAGTCTGGGCAGCTGT 


371 


161 


3636 


Antisense 


CCTGAGTCTGGGCAGCTGTCCCTTG 


372 


16t 


3642 


.Antisense 


TCTGGGCAGCTGTCCCTTGCTTGGC 


373 


161. 


3648 : 


Antisense 


TCCCTTGCTTGCCTGCAGGGCCATG 


374 


161 


3660 


Antisense 
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GCTTGCCTGCAGGGGCATGGCTCAG, 375 161 : 3666 Antisense 

CTTGGGGCGCAGTGCATGGTGGCCA 376 161 3702 Antisense 

GTGGCCAGAGGTGTCACCCAAACCG 377 161 3720 Antisense 

GTCACCCAAACCGGCAGGTGCGATT 378 161 ■•. ■ 3732 . Antisense 
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PubMed 



Sequence Revision History 

Entrez BLAST . OMIM 




Taxonomy 



SITE 
MAP 

Structure 



NCBI Home 

Site Map 

brief and complete 

versions 

About NCBI 

general and contact 
information 

GenBank 
submit your 
seiquence, general 
information 

Molecular 
Databases 

nucleotides, 
proteins, structures 
and taxonomy 

Literature 
Databases 
PubMed, PubRef, 
OMIM, Citation 
Matcher 

Genomes and 
Maps 

maps, the human 
genome and model 
organisms . 

Tools 

for data mining and 
analysis 

Research at 
NCBI 

people and projects 

Software 
Engineering 
Tools, R&D and 
databases 

Education 

teaching resources 
and on-line tutorials. 



The Sequence Revision History tool allows you to see the 
various gi numbers, version numbers, arid update dates for 
sequences that appeared in a specific GenBank record. 

E.g., search for U46667 in the tool to see the old and current 
identifiers of the nucleotide sequence in that record. 

Note that the original gi number for the nucleotide sequence, 
2734632, does not have a corresponding versjon number. This 
is true because it was removed from the database (and replaced 
by 31 72140) before the new accession iverioh system was 
implemented in Feb. 1 999. At that time, each sequence in the 
GenBank/EMBL/DDBJ database received a version number of 1, 
even if they had been updated in the past. 

In addition, if a GenBank record contains an updated sequence, 
the Comment field will contain a cross-reference to the gi 
number of the earlier sequence. (E.g., see U46667 in Entrez.) If 
you follow the link for that earlier gi number, Entrez will display 
that version of the GenBank record. Similarly, the Comment field 
of the older version will have a warning that the sequence has 
been updated, and will contain a cross-reference to the newer 
version. 



More details about seq uence identification numbers (GI and 
accession.version). 

Back to sample record . ; 



FTP site 
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download data and 
software 



Help Desk NCBI NLM N!H Credits 

Revised October 12003 

Questions nhnut NCBI resources to info@ncbi jiim.nih.gov 
cement* about site map to Renata Geer renata@ncb i.nlm ; nih.gov 



httn 7/wwwnr.hi n1m. nib: gov/Sitema.o/seduencerevisionhistorv;html 
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Page 1 of 1 



PubMed 



Nucleotide 



Protein 



Genome 



Sequence Revision History 

Taxonomy 



. Structure 



PMC 



. OMIM 



Find (Accessions, GI numbers or Fasta style Seqlds^ttG^ 



About Entrez 



Entrez 



Search for Genes . 

LocusLink provides cura ted 
information for human; fruit 
fly, mouse, rat, and 
zebrafish , 



Help] FAQ 



Batch Entrez: Upload a 
file of. GI or accession . 
numbers to retrieve 
protein or nucleotide 
sequences 

Check sequence 
revision history 

How to create WWW 
links to Entrez 

LinkOut 

Cubby 

Related resources 

blast" 

Reference sequence project 
LocusLink 

Clusters of orthologous groups 



I difference between I and II as jGenBank/GenPept 



Revision history for U46667 



GI 


Version 


Update Date 


Status 


1 


II 


3172140 


'■' 1 


Aug 7 1998 9:28 AM 


:. Live 


■•® 




3172140 


.1 ' 


Jun 2 1998 4:31 PM 


Dead 


"o 




2734632 


rt/a 


Jan 3 199812:12AM 


Dead 


■ O: 


0 


2734632 


n/a 


Jan 1 1998 12:30 AM 


Dead 


P ' 


■C-, 


Accession U4 


6667 was first seen at NCBI on Jan 1 1998 12:3U AM 



Protein reviews on the web 



Disclaimer | Write to the Help Desk . 
7 NCBI ) NLM | NIH 



V»t+rWAxn*m/ np.hi nlm nih. 

crnv/entre7,/suti1s/eirevhi^t.cei?val=U46667 
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home | logout | register | your profile | contact I site index 



search site 



AFFYMETRIX* 



products 



ANALYSIS i 



« GETTING 
STARTED 

-> Wizard . 

:: QUERY 

Expression 
-> Quick Query 
-> Standard Query 
-> Batch Query 
-> BLAST 

. -> Probe Match 
-> UCSC Query 

Genotyping 
-> Quick Query 
-> Standard Query 
-> Batch Query 
-> UCSC Query 
->.SNP Finder 

:: CURRENT QUERY 
1 probe sets 

-> Annotations 
-> Show Orthoiogs 
/.-> GO Browser 
-> Export 

QUERY HISTORY 

Annotation Views 

Expression 
.-> Genotyping 

-> BLAST Status 

-> New Folder 
-> Expression 
Queries 

->(1)Ail Descriptions 

(L17131) 
->(0)AH Descriptions 

(L17131_rnaLat) 
->(0)A!l DescripTions 

(L17131 rnal at) 
->(1)AH Descriptions 

^m64347) 
. -> all probe sets 

(7129) 
-> Genotyping 
Queries 



the new GeneChip* One-Cycle and 
Two-Cyde cDNA Synthesis Kite, 




SUPPORT j ..TECHNOLOGY .!' -RESEARCH COM MUNSTY j CORPORATE 



Full Record 



-> START > 



Details for HUGENEFL:L17131_RNA1 - AT 
Full Screen 

\ t A - ^ , ■ , Cluster Members 
NetAffx Links cbnsensus/Exempiar 



GeneChip Array Information 



Probe Set ID 
GeneChip 
Array 

Organism 
Common 
, Name 



Transcript ID 

Sequence 
Type 
Representative 
Public ID 

Target 
Description 



L1713l_ma1_at 
HumanGeneFL Array 

Human 

Probe Design Information 

L17131_rna1 
Exemplar sequence 



Target 
Sequence 



L17131 NCBI 

;L1 7131 /class A, 20 probes, 20 in L17131mRNA#1 1646-2198, Human high 
mobility group protein (HMG-I(Y)) gene exons 1-8, complete cds 

• Sequence: 

>HUGENEFL : L 1 7 1 3 1_RNA1_AT ^ 

ttgtccaggtgaggcccaagagccctgtggccgccacctgaggtgggctggggctgctcc 
cctaaccctactttcgttccgccactcagccatttccccctcctcagatggggcaccaat 
aacaaggagctcaccctgcccgc'tcccaacccccctcctgctcctccctgccccccaagg 
ttctggttccatttttcctctgttcacaaactacctctggacagttgtgttgttttttgt; 
tcaatgttccattcttcgacatccgtcattgctgctgctaccagcgccaaatgttcatcc. 
tcattgcctcctgttctgcccacgatcccctcccccaagatactctttgtggggaagagg. 
ggctggggcatggcaggctgggtgaccgactaccccagtcccagggaaggtggggccctg 
cccctaggatgctgcagcagagtgagcaagggggcccgaatcgaccataaagggtgtagg 
ggccacctGCtccccctgttctgttggggaggggtagccatgatttgtcccagcctgggg 
ctccctctctggtttcctatttgcagttacttgaata . 



Probe Sequence(5'-3') 



Probe Probe 
X Y 



TTGTQCAGGTGAGGCCCAAGAGCCC 294 101 

AGGiTGAGGCCCAAGAGCCCTGTGGC 295 101 

■ - ACGAATAACAAGGAGCTCACGCTGC 296 101 

tTTTCCTCTGTTCACAAACTACCTC 297 101 

CTACCTCTGGACAGTTGTGTTGTTT 298 101 

TTCCATTCTTCGACATCCGTCATTG 299 101 

TCTTCGACATCCGTCATTGCTGCTG 300 101 



Probe 
Interrogation 

Position 

1658 

1664 

1772 

1850 

1868 

1904- \ 

1910 



Strandednes? 

Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
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GCTACCAGCGCCAAAT GTTCATCCT 


301 


101 


.1934 


Antisense 


TCATCCTCATTGCCTCCTGTTCTGC 


302 


101 


. 1952 


Antisense 


TCATTGCCTCCTGTTCTGCCC ACGA 


303 


101 


1958 . . . 


Antisense ; 


aagatActctttgtggggaagaggg 


304 


101 


1994 


Antisense 


GCAGGCTGGGTGACCGACTACCCCA 


305 


101 


2030 


Antisense 


CCCCTAGGATGCTGCAGCAGAGTGA 


■ 306 


101 


2078 


Antisense 


agcaagggggcgcgaAtcgaccata 


307 


101 


2102 


Antisense 


CGAATCGACCATAAAGGGTGTAGGG 


308 


101 


2114 


Antisense 


GCCATGATTTGf CCCAGCCTGGGGC 


309 


101 ' 


2174 


Antisense 


CTGGGGCTCCCTCTCTGGT7TCCTA 


310 


101 


2192 


Antisense 


CTCCCTCTCTGGTTTCCTATTTGCA 


311 


101 


2198 


Antisense 


CTCTGGTTTCCTATITGCAGTTACT 


312 


-101 


2204 


Antisense 


tttcctatttgcAgttacttgaata 


31 3 , 


101 


2210 


Antisense 
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home | logout {register | yoiir profile | contact | site index 



search site 



AFFYMETRIX, 



PRODUCTS 




SUPPORT : TECHNOLOGY \ RESEARCH COMMUNITY j CORPORATE 



-> START ) 



" GETTING 
STARTED 

:-> Wizard • 

:: QUERY 
Expression 

~> Quick Query ; 
-> Standard Query 
-> Batch Query 
-> BLAST. 
-> Probe Match 
-> UCSC Query 
Genotyping 
-> Quick Query , 
-> Standard Query 
-> Batch Query . 
-> UCSC Query 
-> SNP Finder ; . 

CURRENT QUERY . 
1 probe sets 

. -> Annotations . 
-> Show Orthologs 
-> GO Browser 
-> Export 

« QUERY HISTORY 

Annotation Views 

-> Expression 
-> Genotyping 

-> BLAST Status 

-> New Folder 
-> Expression 
Queries 

»(1)Ail Descriptions 

(X74801) 
->(1)AII Descriptions 

(L17131) 
r ->(0)AII Descriptions 
(L17131jrnahat) 
->(0)AII Descriptions 
(L17131_rnaLat) 
. ->(1)AII Descriptions 

(m64347) 
-> Genotyping 
Queries 



Full Record 



Details for HUGENEFL:X74801_AT 
Full Screen 

'.==; t i , Cluster Members 
NetAffx Links Consensus/Exe m D |ar 

GeneChip Array Information 

Probe Set ID X74801_at 
G Ar G ray IP :HumanGeneFL Array , 



Organism 
Common 
Name 



Human 



Probe Design Information 



Transcript ID X74801 

& Type Ce Exem P ,ar sequence 

Re^sen^ve X74801 NCBr 
Public ID - ~ '■ " 



Target 
Description 



Assembly 
Alignment(s) 



Overlapping 
Transcripts 



X74801, class B, 20 probes, 12 in X74801cds 1282-1552: 8 in reverseSequence, 
1636-1837, H.sapiens Cctg mRNA for chaperonin 

Genomic Alignment of Target Sequence 

April 2003 (NCBI 33) 

Position % Identity Cytoband 



q22 



Gene Title 

Gene Symbol CCT3 HGNC 

Chromosomal 
Location 

UniGene ID 

Ensembl 

LocusLink 



■". chrl : 153495555-153497649 (-) UCSC 100 

Representative : UniGene Description 
Transcript. 

NMJD05998 chaperonin containing TCP1 , 
NCBI subunit 3 (gamma) 

Public Domain and Genome References 
chaperonin containing TCP1 , subunit 3 (gamma) 



Position 



chrl :1 53495551-1 53524840 
(-)UCSC 



SwissProt 



OM1M 



1q23 

Hs.1708 NCBi (FULL LENGTH) 

ENSG000001 63468 Ensernb! . 

7203 NCBI 

AAH06501 EMBL-EBI 
P49368 EMBL-EBI 

600114 NCBI 



RefSeq Protein Np _ 005989 NCB} 
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RefSeq 



Ortholog 



RefSeq Transcript ID RefSeq Title 

NM 005998 NCBI chaperonin containing TCP! , subunit 3. (gamma) 

Functional Annotations 

id 



ATH1 -121 501:246830 AT 
ATGENOME1:18906 AT 
DROSGENOME1:1 53982 AT 
MG-U74AV2:161238 F AT 
MG-U74AV2:98153 AT : 
MG-U74CV2:171548 AT 
MQE436A:1416024 X AT 
MOE430A:1426067 X AT 
MOE430A.1448178 A AT 
MOE430A:1 449645 S AT 
MOE430A:1 451 915 AT 
MOE430A:1459987 S AT 



Title Organism type 

chaperonin, putative Arabidopsis Putative 

Ortholog 

chaperonin, putative Arabidopsis Putative 

•". Orthblog 

Drosophila Putative 
r Orthblog 

chaperonin subunit 3 Mouse Curated. 
(gamma) , Ortholog 

chaperonin subunit 3 Mouse Curated 

(gamma) . Ortholog 

chaperonin subunit 3 Mouse Curated 

(gamma) v Ortholog 

chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 

chaperonin subunit 3 Mouse , Curated 

(gamma) Ortholog 

chaperonin subunit 3 Mouse? Curated 

(gamma) Ortholog 

chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 

chaperonin subunit 3 Mouse ; Curated 

Ortholog 

Curated 
Orthoipg 



(gamma) 

chaperonin subunit 3 Mouse 
(gamma) 

MU11KSUBA:C79428 RC F AT chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 

MU11KSUBA:L205Q9 F AT chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 

MOUSE430 2:1416024 X AT chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 

MOUSE430 2:1426067 X AT chaperonin subunit 3 Mouse : Curated 

(gamma) Ortholog 

MOUSE430 2:1448178 A AT chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 

MOUSE430 2:1449645 S AT chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 

MOUSE430 2:1451915 AT chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 

MOUSE430 2:1459987 S AT chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 

MOUSE430A 2:1416024 X AT chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 

MOUSE430A 2:1426067 X AT chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 

MOUSE430A 2:1448178 A AT chaperonin subunit 3 Mouse Curated 

(gamma): Ortholog 

MOUSE430A 2:1449645 S AT chaperonin subunit 3 Mouse Curated 

; (gamma) . Ortholog 

MOUSE430A 2:1451915 AT chaperonin subunit 3 Mouse Curated 

(gamma) . Ortholog 
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MOUSE430A 2:1459987 S AT chaperonin subunit 3 Mouse Curated 

v (gamma) . Ortholog 



GO Biological Process (view graph) * 

ID. Description ; Evidence 
6457 protein folding traceable author statement 
GO Cellular Component (view graph) 
ID Description Evidence 
Gene Ontology 5829 cytosol not recorded / 

5856 cytoskeieton traceable author statement, 
GO Molecular Function (view graph) 

ID Description Evidence 
3754 chaperone activity traceable author statement 



Links 
QuickGO AmiGO 

Links . 
QuickGO AmiGO 
QuickGO AmiGO 

Links 

QuickGO AmiGO 



Protein 
Similarities 



Protein 
Domains 



5524 ATP binding 



inferred from electronic annotation QuickGO AmiGO 



Description 



Method ID 

blast 33873532 
blast 31542292 chaperonin containing TCP1, subunit 3 (gammia); 0.0 

TCP1 (t-complex-1 ) ring complex, polypeptide 5 

[Homo sapiens] 



E-Value 

0.0 



Database 

scop 

scop 
[ scop 
pfam 
InterPro 
InterPro 
InterPro 
InterPro 



ID / V Description 
d1a6da3 d1a6da3 SCOP:d.56,1.2:|:Thermosome 

dlqmla digmla_SCOP:c.8.5.2:| Thermospme ' 

d1a6da1 d1a6da1 SCOP:a.129.1.2:| Thermosome 



cpn60 TCP1 TCP-1/cpn60 chaperonin family 

IPR002423 Chaperonin Cpn60/TCP-1 
EMBL-EBI ; 

IPR001844 Chaperonin Cpn60 

EMBL-EBI ■ ">.• . ; / 

IPR002194 Chaperonin TCP- 1 
EMBL-EBI . 

IPR008950 GroEL-like chaperone, ATPase 
EMBL-EBI 



E-Value 

4^08E- 
25 

1.01E- 

57 

4.81 E-v 
83 

5.7E- 
210 



Target 
Sequence 



Sequence 

>HUGENEFL:X74801_AT ; .' . : : , 
atgactggtgtggaacaatggccatacagggctgttgcccaggccctagaggtcatt 

. cgtaccctgatccagaactgtggggccagcaccatccgtctacttacctcccttcgggcc 
aagcacacccaggagaactgtgagacctggggtgtaaatggtgagacgggtactttggtg 
gacatgaagg^actgggcatatgggagccattggctgtgaagctgcagac.ttataagaca 
gcagtggagacggcagttctgctactgcgaattgatgacatcgtttcaggccacaaaaag 
aaaggcgatgaccagagccggcaaggcggggctcctgatgctggccaggagtgagtgcta 
ggcaaggctacttcaatgcacagaaccagcagagtctccccttttcctgagccagagtgc 

.caggaacactgtggacgtctttgttcagaagggatcaggttggggggcagcccccagtcc 
ctttctgtcccagctcagttttccaaaagacactgacatgtaattcttctctattgtaag. 
gtttccatttagtttgcttccgatgattaaatctaagtca . 



Probe Sequence^'-Ty 



Probe Probe 
X Y 



Probe 

Interrogation strandedhesa 
Position 



htas://ww^ 
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Probe Info 



ATGACTGGTGTGGAACAATGGCCAT 60 345 1294 Antisense 

GAACAATGGCCATACAGGGCTGTTG 61 345 1 306 Antisense 

CTGATCCAGAACTGTGGGGCCAGCA 62 345 1360 Antisense 

CAGAACTGTGGGGCCAGCACCATCC 63. 345 1366 Antisense 

TGTGGGGCCAGCACCATCGGTCTAC 64. 345 1372 Antisense 

CTGGGCATATGGGAGCCATTGGCTG 65 345 14€t6 ; Antisense 

ATATGGGAGCCATTGGCtGTGAAGC 66 345 1492 Antisense 

GAGCCATTGGCTGTGAAGCTGCAGA 67 . 345 : 1498 . Antisense 

TTGGCTGTGAAGCTGCAGACTTATA 68 345 1504 : Antisense 

GAGACGGCAGTTCTGCTACTGCGAA 69 345 1540 Antisense 

GCAGTTCTGCTACTGCGAATTGATG 70 345 1 546 Antisense 

ATTGATGACATCGTTTCAGGCCACA 71. 345 1564 Antisense 

GTGCTAGGCAAGGCTACTtCAATGC 72 345 1648 Antisense 

GGCAAGGCTACTTCAATGCACAGAA 73 345 1654 Antisense 

GCTAGTTCAATGCACAGAAGCAGCA 74 345 1660 Antisense 

CACAGAACCAGCAGAGTCTCCCCTT 75 345 1672 Antisense 

GAGCCAGAGTGCCAGGAACACTGTG 76 345 1702 Antisense 

CACTGACATGTAATTCTTCTCTATT 77 345 1 804 Antisense 

TAGTTTGCTTCCGATGATTAAATCT 78 345 1843 Antisense; 

GCTTCCGATGATTAAATCTAAGTCA 79 345 1849, Antisense 
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Full Record 



Details for HUGENEFL:U15008_AT 
Fullscreen 

u . .„ . Cluster Members 
NetAffx Links consensus/Exemplar 

GeneChip Array Information 
Probe Set ID U 15008 at \ 



-> START v 



GeneChip 
Array 

Organism 
Common 
Name 



HumanGeneFL Array 



Human 



Probe Design Information 

Transcript ID U15008 
Se ?ype Ce Exemplar sequence 

Re pSS^ e U15008 



Target 
Description 



Assembly 
Alignments) 



Overlapping 
Transcripts 



Gene Title 

Gene Symbol 

Chromosomal 
Location 

UniGene ID 

Ensembl 

LocusLink 

SwissProt 

OMIM 



U15008. class A, 20 probes, 20 in U 15008 25-433, Human SnRNP core protein 
Sm D2 mRNA, complete cds 

Genomic Alignment of Target Sequence 
April 2003 (NCBI 33) 

Position % Identity Cytoband 

chM9: 50882580-50883664 (-) UGSC 98 q 13.32 



Representative 
Transcript 

NM_004597 
NCBI 

NM 1 77542 
■ NCBI 



UniGene Description 

small nuclear ribonucleoprotein D2 
polypeptide 16.5kDa 
small nuclear ribonucleoprotein D2 
polypeptide 16.5kDa 

Public Domain arid Genome References 
small nuclear ribonucleoprotein D2 polypeptide 16.5kDa 
SNRPD2 HGNC 

19q13.2 • ... 

Hs.424327 NJCBi (FULL LENGTH) 
ENSG00000125743 Ensembl , 
6633 NCBI \ 
P43330 EMBL-EBI \ 
601061 NCBI . 



Position 

chrl 9:50882558- 
50887282 (-) UCSC 

chrl 9:50882558- 
50887282 (-) UCSC 
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RefSeq Protein- NPJ)04588 NCBI 
ID 



RefSeq 



NP_808210 NCBi . . 
RefSeq Transcript ID RefSeq Title 

NM_004597 NCBI small nuclear ribonucleoprotein polypeptide D2 
NM_1 77542 NCBI small nuclear ribonucleoprotein polypeptide D2 

Functional Annotations 



ID ; 

ATHM 21 501:266482 AT 



Title Organism Type 

smallnuclear ribonucleo Arabidopsjs Putative 



Ortholog 





protein D2 -related 




Ortholog 


C. ELEGANS:172931 X AT 


small nuclear 


Celegans 


Putative 




ribonucleoprotein D2 




Ortholog 




like ... 






DROSGENOME1: 153483 AT 




Drosophila 


Putative 








Ortholog 


MG-U74AV2:95049 AT 


small nuclear 


Mouse 


Curated 




ribonucleoprotein D2 




Ortholog 


MOE430A: 1452680 AT 


small nuclear 


Mouse 


Curated 




ribonucleoprotein D2 




Ortholog 


MU1lkSUBA:AA271024 S AT 


small nuclear 


Mouse , 


Curated 




ribonucleoprotein D2 




Ortholog 


MOUSE430 2:1452680 AT 


small nuclear 


.. Mouse 


Curated 




ribonucleoprotein D2 




Ortholog 


MOUSE430A 2:1452680 AT 


smallnuclear 


Mouse . 


Curated 




ribonucleoprotein D2 




Ortholog 



GO Biological Process (view graph) 

ID Description 
245 spliceosome assembly 



6371 mRNA splicing 

GO Cellular Component (view graph) 
ID Description 
Gerie Ontology 5681 spliceosome complex 



Evidence 

traceable author 
statement 

traceable author 
statement 

Evidence 

traceable author 
statement 



5732 ' small nucleolar ribonucleoprotein inferred from electronic 
complex annotation 

30532 small nuclear ribonucleoprotein traceable author 
complex statement 

GO Molecular Function (view graph) 

ID Description Evidence 

8248 pre-mRNA splicing factor activity inferred from electronic 

annotation 



Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 



Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 

QuickGO 
AmiGO 



Links 

QuickGO 
AmiGO 



Protein 



Method 


ID 


Description 


E-Value 


blast 


4759158 


small nuclear ribonucleoprotein polypeptide D2; 
snRNP cpre protein D2 [Homo sapiens] 


1 ? 0E-62 


blast 


26337731 




3.0E-62 


blast \ 


4759158 


small nuclear ribonucleoprotein polypeptide D2; 
snRNP core protein D2 [Homo sapiens] 


1.0E-62 


blast • . 


26337731 




30E-62 


Database 


f ID 


Description 


E-Value 


scop 


d1b34b 


d1b34b_ SCOP:b.38.1 .1 :| D2 core SNRNP 


.1.85E- 
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Protein 
Domains 



protein ; 28 

scop d1b34b d1b34b_SCOP:b 38.1.1 :| D2 core SNRNP 1.85E- 

. protein ■ 28 . 

pfam LSM LSM domain 1.1E-16 

pfam LSM LSM domain 1.1E-16 

InterPro IPR001 163 Small nuclear ribonucleoprotein i (Sny protein) 
EMBL-EBI 



Target 
Sequence 



Sequence 

>HUGENEFL : Ul 5 0 0 8_AT 

accatcatgagcctcctcaacaagcccaagagtgagatgaccccagaggagctgcagaag 
cgagaggaggfaggaatttaacaccggtccactctctgtgctcacacagtcagtcaagaac 
aatacccaagtgctcatcaactgccgcaacaataagaaactcctgggccgcgtgaaggcc 
ttcgataggcactgcaacatggtgctggagaacgtgaaggagatgtggactgaggtaccc 
aagagtggcaagggcaagaagaagtccaagccagtcaacaaagaccgctacatctccaag 
atgttcctgcgcggggactcagtcatcgtggtcctgcggaacccgctcatcgccggcaag 
taggggccgcctgtctgttgacagaactcactcctctgtcctatgaagaccgctgccatt 
ggfcgttgagaata 



Probe Info 



Probe Probe 

Probe Sequence(5 , -3') x y 

ACCATCATGAGCCTCCTCAACAAGC 99 211 

AGTGAGATGACCCCAGAGGAGCTGC 100 211 

AACACCGGTCCACTCTCTGTGCtCA 101 21 1 

GGTCCACTCTCTGTGCTCACACAGT 102 211 

CTCTCtGTGCTCACACAGTCAGTCA 103 211 

GTGCTCACACAGTCAGTCAAGAACA 104 211 

TCAGTCAAGAACAATACCCAAGTGC 105 211 

AATACCCAAGTGCTCATCAACTGCC 106 21 1 

CAAGTGCTCATCAACTGCCGCAACA 107 211 

CGCGTGAAGGCCTTCGATAGGCACT 108 211 

AAGGCCTTCGATAGGCACtGCAACA 1 09 21 1 

TTCGATAGGCACTGCAACATGGTGC 110 211 

GTACGCAAGAGTGGCAAGGGCAAGA 1 11 211 

TACATCTCCAAGATGTTCCTGCGCG 112 211 

TCAGTCATCGTGGTCCTGCGGAACC 113 211 

TAGGGGCCGCGTGTCTGTTGACAGA 114 211 

TGACAGAACTCACTCCTCTGTCOTA 115 211 

CJCCTCTGTCCTATGAAGACCGGTG 116 211 

TGTCCTATGAAGACCGCTGCCATTG 117 211 

ACCGCTGGCATTGGTGTTGAGAATA 118 211 



Probe 
Interrogation 

Position 

37 

67 
115 
121 
127 
133 
145 
157 

163 . . 

205 

211 

217 

271 

325 

355 

397- 

415 

427 

433 

445. 



Strandedness 

Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 



888-DNA-CHIP (888-362-2447) \ +44 (0) J 628 1&2550 . feedback i e-mail support | terms of use J privacy policy 



htths://>^.affvmetrix 3/30/2004 



Affymetrix - Results 



Page ! of 2 



home *| logout [ register | your profile | contact | site index 



search site 



AFFYMETRIX. 

JOCK 



ANALYSE 



PRODUCTS _ 

GETTING 
STARTED 

. -> Wizard 



QUERY 
Expression 

.-> Quick Query 

-> Standard Query . 

-> Batch Query 

-> BLAST 

-> Probe Match 

-> UCSC Query 
Genotyping .' 
. Quick Query 

-> Standard Query 
.. -> Batch Query 

-> UCSC Query 

-> SNP Finder 

CURRENT QUERY 
1 probe sets 

-> Annotations . 
-> Show Orthologs 
->GO Browser 
-> Export . 

:: QUERY HISTORY 

Annotation Views 

-> Expression 
-> Genotyping 

-> BLAST Status 

-> New Folder 
-> Expression 
Queries 

•>(1)AH Descriptions 

(AFFX-BioB-M_st) 
->(1)AII Descriptions 

(HG613) 
->(2)All Descriptions 

(AFFX-BioDn-5) . 
->(2)AII Descriptions 

(AFFX-BioB-M) 
->(1)AII Descriptions 

(M12625j3t) . 
-> Genotyping 
Queries 



Announcing the New 

GeneChip* Rat Genome 230 2.0 Array 




SUPPORT TECHNOLOGY RESEARCH COMMUNITY i CORPORATE 



Full Record 



Details for HUGENEFL:AFFX-BIOB-M_ST 
FullScreen 

Cluster Members 
Consensus/Exemplar 

GeneChip Array Information 

AFFX-BioB-M st 



-> START ^ 



NetAffx Links 



Probe Set ID 
GeneChip 
Array 

Organism 
Common 
Name 



Transcript ID 
Sequence 
Type 



HumanGeneFL Array 



Human 



Probe Design Information 



AFFX-BioB-M 
Control sequence 



Representative J04423NCB| 
Public iu — . 



target 
Description 



Target 
Sequence 



J04423 E coli bioB gene biotin synthetase (-5, -M, -3 represent transcript regions 
5 prime, Middle, and 3 prime respectively) 

Sequence 

>HUGENEFL : AFFX-BIOB- M_ST 

gccggagttttacggcaatatcatcaccacacgcacttatcaggaacgcctcgatacgct 
ggaaaaagtgcgcgatgccgggatcaaagtctgttctggcggcattgtgggcttaggcga 
aacggtaaaagatcgcgccggattattgctgcaactggcaaacctgccgacgccgccgga 
aagcgtgccaatcaacatgctggtgaaggtgaaaggcacgccgcttgccgataacgat^a 
tgtcgatgcctttgattt 



Probe Sequence(5"-3') 



Probe Info 



201 
202 
203 
204 
205 



GATGATATTGCCGTAAAACTCCGGC 
TGTGGTGATGATATTGCCGTAAAAC 
TAAGTGCGTGTGGTGATGATATTGC 
GTTCCTGATAAGTGCGTGTGGTGAT 
ATCGAGGCGTTCCTGATAAGTG5GT 
GCATCGCGCACTTT1TCCA0CGTAT 206 
GATCCCGGCATCGCGCACTTTTTCC 207 
GACTTTGATCCCGGCATCGCGCACT 208 
CGCCAGAACAGACTTTGATCCCGGC 209 
CCCACAATGCCGCCAGAACAGACTT 210 



Probe 
Y 


Probe 

Interrogation strandedness 
Position 


11 


483 


Sense 


11 


489 . 


Sense 


11 


497 


Sense 


11 


505 . 


Sense 


11 


513 


Sense 


11 


536 


Sense 


11 


543 


Sense 


11 


549 


Sense 


11 


559. 


Sense 


11 


569 


Sense 
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TGCAGCAATAATCCGGCGCGATCTT 


211; 


11 . 


611 ; 


Sense 


TTGCGAGTTGCAGCAATAATCCGGQ : 


212 


11 


619 • 


..Sense 


CGGCAGGTTTGCCAGTTGCAGCAAT 


213 


11 


627 


Sense 


ATGTTGATTGGCACGCTTTCCC3GCG 


214 


11 


656 


Sense 


CACCAGCATGTTGATTGGCACGCTT 


215 


11 


663 


Sense 


: TTCACCTTCACCAGCATGTTGATTG 


216 


.11 


671 


Sensed 


AGGGGCGTGCCtTTCACCTTCACCA 


217 


11 


683 


Sense 


CATCATCGTTAf CGGCAAGGGGCGT 


218 


11 


- 700 


Sense 


GCATCGACATCATCGTTATCGGCAA 


219 


11 


707 


Sense 


AAATCAAAGGCATCGACATCATCGT 


220 : 


11 


716 


Sense 
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NetAffx Links 



Probe Set ID 

GeneChip 
Array 

Organism 
Common 
Name 



Transcript ID 
Sequence 
Type 



HumanGeneFL Array 



Human 



Probe Design Information 



AFFX-BioDn-5 
Control sequence 



Representative J0 _ NCBj 

Public ID ^44^JNIOWI 



Target 
Description 



Target 
Sequence 



J04423 E coli bioD gene dethiobiotin synthetase (-5 and -3 represent transcript 
regions 5 prime and 3 prime respectively) 

Sequence 

>HUGENEFL : AFFX-BIODN- 5_ST 

gggaaaactgtcgccagttgtgcacttttacaagccgGaaaggfcagcaggctaccggacg 
gcaggttataaaGcggtcgcctctggcagcgaaaagaccccggaaggftttacgcaatagc 
gacgcgctggcgt.tacagcgcaacagcagcctgcagctggattacgcaacagtaaatcct 
tacacct t cgcagaacccac t tcgccgcacat cat cagcgcgcaagagggcagaccgata 
gaatcattggtaatgagcgccggattacgcgcgc.ttg 



Probe Info 



Probe Sequence(5*-3') 


Probe 
X 


Probe 
Y 


Probe 
Interrogation 


Strandedness 






Position 




GTGCACAACTGGCGACAG 1 1 1 ICCC 


281 


11 


49 


Sense 


GGCTTGTAAAAGTGCACAACTGGCG 


282 


11 


60 


Sense 


GCTGCCTTTGCGGCTTGtAAAAGTG 


283 


11 


71 


Sense 


GGTAGCGTGCTGCCTTTGCGGCTTG 


284 . 


11 


79 


Sense 


CCGTCCGGTAGCCTGCTGCCTTTGC 


285 


U 


85 


Sense 


CAGCGCGTCGCTATTGCGTAAACCT 


286 


11 


153 


Sense v 


GTAACGCCAGCGCGJCGCTATTGCG 


287 


It 


160 


Sense 


TTdCGCTGTAACGCCAGCGCGTCGC 


288 


11 


167 


Sense 


TGCTGTTGCGCTGTAACGCCAGCGC 


289 


11 


172 


Sense 


TGCAGGCTGCTGTTGCGCTGTAACG 


290 


11 


179 


Sense 
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TGCAGCTGCAGGCTGCTGTTGCGCT 


291 


11 


185 


Sense 


TGCGTAATCCAGCTGCAGGCTGCTG 


.292. 


11 


192 


Sense 


TTACTGTTGCGTAATCCAGCTGCAG 


293 


11 


199 


:. Sense 


CGGTCTGCCCTCTTGCGCGCTGATG 


294 


it; 


261 


Sense 


GATTCTATCGGTCTGCCCTCTTGCG 


295 


11 


269 


Sense 


TAnCAATGATTCTATCGGTCTGCCC 


296 


.11- 


276 


Sense 


CTCATTAGCAATGATTCTATCGGTG 


297 


11 


281 


Sense 


TCCGGCGCTCATTACCAATGATTCT 


298 


11 


288 


Sense 


CGCGTAATCCGGCGCTCATTACCAA 


299 


11 


295 


Sense 


CAAGCGCGCQTAATCCGGCGCJCAT 


300 • 


11 


301 


Sense 
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NetAffx Links 



Probe Set ID 
GeneChip 
Array 

Organism 
Common 
Name 



HumanGeneFL Array 



Human 



Probe Design Information 



Transcript ID 

Sequence 
Type 
Representative 
Public ID 

Target 
Description 



Assembly 
Alignment(s) 



Overlapping 
Transcripts 



Gene Title 

Gene Symbol 

Chromosomal 
Location 

UniGene ID 

Ensembl 

LocusLink 

SwissProt 

OMIM. 



X15880 

Exemplar sequence 
X15880 NCBI 

X15880, class C, 20 probes, 20 in all_X15880 1690-2273, Human mRNA for 
collagen VI alpha-1 C-terminal globular domain 

Genomic Alignment of Target Sequence 

April 2003 (NCBI 33) 

Position % Identity Cytoband 

chr21 : 46280561-46281 145 (+) UCSC 100 q22.3 , 

Representative UniGene Description Position 
Transcript 

."■ NM_001848 collagen, type VI, alpha chr21:46257869-46281164 (+) 
NCBI 1 UCSC 

Public Domain and Genome References 
collagen, type VI, alpha 1 
COL6A1 HGNC 



21q22.3 

Hs!415997 NCBI (FULL LENGTH) 

ENSG0000Q142156 Ensennbl 

1291 NCBI 

P12109 EMBL-EBI 
Q7Z645 EMBL-EBI 
Q8TBN2 EMBL-EBI 
Q9BSA8 EMBL-EBI 

120220 NCBI 
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RefSeqProtein N pj,6 18 39jaCBL 



RefSeq 



Ortholog 



RefSeq Transcript ID RefSeq Title 

NMJ)01848 NCBi collagen, type VI, alpha 1 precursor 

Functional Annotations 

ID Title Organism Type 

MG-U74AV2:1 62459 F AT procollagen, type VI, Mouse Curated 
. alpha 1 Ortholog 

MG-U74AV2:95493 AT procollagen, type VI, Mouse .. Curated 

alpha 1 Ortholog 

MOE430A:1448590 AT procollagen, type VI, Mouse Curated 

alpha 1 Ortholog 

MU11KSUBB:X66405 S AT procollagen, type VI, Mouse Curated 

alpha 1 Ortholog 

MOUSE430 2:1448590 AT procollagen, type VI, Mouse Curated 
-. alpha 1 Ortholog 

MOUSE430A 2:1448590 AT procollagen, type VI, Mouse Curated 

alpha 1 : Ortholog 



Gene Ontology 



GO Biological Process (view graph) 

ID. Description Evidence 

7155 cell adhesion non-traceable author 

statement 

GO Cellular Component (view graph) 
ID Description Evidence 

5578 extracellular matrix • .. inferred from electronic 

annotation 
. non-traceable author 
statement 

GO Molecular Function (view graph) 

ID Description Evidence 

5194 cell adhesion "molecule activity inferred from electronic 

annotation 

5201 extracellular matrix structural inferred from electronic 
constituent ... annotation 



Protein 
Similarities 



Protein 
Domains 



5589 collagen type VI 



Links 

QuickGO 
AmiGO 



Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 

Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 



Method 


ID 


Description 


blast 


15011913 




biast 


13878903 




Database 


ID 


Description 


scop 


dlatza 


d1atza_ SCOP:c.62.1 von Willebrand 




A3 domain 


pfam 


vwa 


von Willebrand factor type A domain , 


pfam 


vwa < 


von Willebrand factor type A domain 


pfam 


vwa 


von Willebrand factor type A domain 


pfam 


Collaaeri 


Coliagen triple helix repeat (20 copies) 


pfam 


. Collaqen 


. Collagen triple hejix repeat (20 copies) 


pfam 


Cbllaaen 


Collagen triple helix repeat (20 copies) 


pfam 


Coliaaen 


Collagen triple helix repeat (20 copies) 


InterPro 


IPR008161 


Collagen helix repeat .. 




EMBL-EBI 




InterPro . 


IPR002035 von Willebrand factor, type A 




EMBL-EBI 





E-Value 

0.0 
0.0 

E-Value 

3.63E- 
37 

9.6E-24 
4.7E-32 
2.7E-35 
2.4E-11 
3.8E-14 
3.3E-10 
2.6E-11 
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JnterPro IPR008160 Collagen triple helix repeat 
EMBL-EBI . 



Sequence 

>HUGENEFL : XI 5 8 8 0_AT 

agcaagacgcctctcggggcctgtgcGgcactagcctccctctcctctgtccccatagct 
ggtttttcccaccaatcctcacctaacagttactttacaattaaactcaaagcaagctct 
tctcctcagcttggggcagccattggcctctgtctcgttttgggaaaccaaggtcaggag 
. gccgt tgcagacataaatctcggcgactcggccccgtctcctgagggtcctgctggtgac 
: Target cggcctggaccttggccctacagccctggagg.ccgctgctgaccagcactgaccccgacc 
Sequence . tcagagagtactcgcaggggcgctggctgcactcaagaccctcgagattaacggtgctaa 
■"; •ccccgtctgctcctccctcccgcagagactggggcctggactggacatgagagccccttg 
gtgccacagagggctgtgtcttactagaaacaacgcaaacctctccttcctcagaatagt 
. . gatgtgttcgacgttttatcaaaggccccctttctatgttcatgttagttttgctccttc 
tgtgtttttttctgaaccatatccatgttgctgactttt.ccaa 



Probe SequencelS'-a') 



Probe 
X 



Probe Info 



; AGCAAGACGCCTCTCGGGGCCTGTG 76 

aMCTCAAAGCAAGCTCTTCTCCTC 77 

AAAGCMGCTCTTGTCCTCAGCTTG 78 

tCTCCTCAGCTTGGGGCAGCCATTG 79 

GCCATTGGCCTCTGTCTCGTTTTGG 80 

GCAGACATAAATCTCGGCGACTCGG 81 . 

GCCdCGTCTCCTGAGGGTCCTGCTG 82 

TGGCCCTACAGCCCTGGAGGCCGCT 83 

TGAGAGAGTACTCGCAGGGGCGCTG 84 

AGTACTCGCAGGGGCGCTGGCTGCA 85 

GGCGCTGGCTGCACTCAAGACCCTC 86 

GGACATGAGAGCCCCTTGGTGCCAC 87 

GAGAGCCCCTTGGTGCCACAGAGGG 88 

CCCTTGGtGCCACAGAGGGCTGTGT 89 

GTGCCACAGAGGGCTGTGTCTTACT 90 

CAGAGGGCTGTGTCTTACTAGAAAC 91 

CTCCTTCCTCAGAATAGTGATGTGT 92 

TTTTTCTGAACCATATCCATGTTGC 93 

TGAACCATATCCATGTTGCTGACTT 94 

. ATATCCATGTTGCTGACTTTTCCAA 95 



Probe 
Y 

317 
317 
317 
317 
317 
317 
317 
317 
317 
317 
317 
317 
317 
317 
317 
317 
317 
317 
317 
317 



Probe 
Interrogation 

Position 

1702 

1804 

1810 

1822 

1840 

1888 

1912 

1954 

2002 

2008 . 

2020 

2104 

2110 

2116 

2122 

2128 

21.64 

2248 

2254 

2260 



Stranded nesi 

Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
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search site 



AFFYMETRIX. 



PRODUCTS 



: ANALYSIS 



GETTING 
STARTED 

. -> Wizard 

:: QUERY 

Expression 
r> Quick Query ; 
.-> Standard Query 
-> Batch Query ' 
f -> BLAST ■ 
r> Probe Match 
-> UCSC Query 

Genotyping 
-> Quick Query 
s> Standard Query 
-> Batch Query 
->UCSC Query 
->SNP Finder 

CURRENT QUERY 
1 probe sets 

-^Annotations 
'« -> Show Orthologs 
. >>. GO Browser . 

r> Export 

:: QUERY HISTORY 

Annotation Views 

-> Expression.. 
-> Genotyping 

-> BLAST Status 

-> New Folder 
-> Expression 
Queries 

.->(1)AII Descriptions; 
. (U23752__at) 
->(1)AII Descriptions 
(HG1800- ■ 
HT1823_at) 
«>(1)AII Descriptions 
(U15008jat) 
. ->(1)AII Descriptions 
(HG3523) : 
->all probe sets 
(7129) 
r> Genotyping 
Queries 




•Jil 



SUPPOmr | TECHNOLOGY |" RESEARCH COMMUNITY .( CORPORATE 



Full Record 



Details for HUGENEFL:U23752_AT 
Full Screen 

Cluster Members 
Consensus/Exemplar 

GeneChip Array Information 

Probe Set ID U23752_at 

GeneChip H umanGeheFL Array 
Array 

Organism 



-> START ^ 



NetAffx Links 



Common 
Name 



Human 



Probe Design Information 

Transcript ID U23752 

Sequence templar sequence 
Type 

Re ^^ ve U23752^ 



Target 
Description 



Assembly 
Alignment(s) 



Overlapping 
Transcripts 



U23752, class A, 20 probes, 20 in U23752 1679-1919, Human SOX-11 mRNA, 
complete cds 

Genomic Alignment of Target Sequence 
April 2003 (NCBI 33) 

Position- % Identity Cytoband 

chr2: 5856192-5856457 (+) UCSC 99 p25.2 



Representative UniGene Description Position 

Transcript 

NM_003108 SRY (sex determining region Y)- chr2:5854537-5863255 (+) 

NCBI box 11 UCSC 



Public Domain and Genome References 
SRY (sex determining region Y)-box 11 



Gene Title 

Gene Symbol SOX11 HGNC 

Chromosomal 
Location ' 

UniGene ID 

Ensembl 

. LocusLink 

SwissProt 

OMIM 



2p25 

Hs.432638 NCBI (FULL LENGTH) 
ENSGOObO0 176887 Ensembl 
6664- jNCBI 

P35716 EMBL-EBI . 
600898 NCBI . 



RefSeq Protein Np ^ 003099 NCB | 
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RefSeq 



Ortholog 



Gene Ontology 



Protein 
Similarities 



Protein 
Domains 



RefSeq transcript ID RefSeq title 
NM_003108 NCBI SRY-box 11 

Functional Annotations 

ID Title 

RAE230A:1 387275 AT SRY-box containing gene 

■ - ■•• 11 • 

SRY-box containing gene 
11 



RG- 

U34A:AJ004858 AT 



Organism 
Rat 

Rat 



GO Biological Process (view graph) 

ID Description Evidence 

6355 regulation of transcription, DNA- inferred from electronic 
dependent : 

7399 neurogenesis 



annotation 

traceable author 
statement . 



GO Cellular Component (view graph) 

ID Description 
5634 nucleus 

GO Molecular Function (view graph) 

ID Description 
3677 DNA binding 



Method 
blast 



blast 



ID 

4507161 



Evidence 

inferred from electronic 
annotation 



Evidence 

inferred from electronic 
annotation, ' 

Description 



Type 

Putative 
Ortholog. 

Putative 
Ortholog, 



Links 

QuickGO 
AmiGO 

QuickGO 
AmiGQ : . 



Links 

QuickGO 
AmiGO 



Links 

QuickGO 
AmiGO 



SRY-box 11 ; SRY (sex-determining 
region Y)-box 11; SRY-related 
HMG-box gene 1 1 ; transcription 
factor SOX-1 1 [Homo sapiens] 



23831472 



Database ID 
scop d1i11a 



Description 
d1M1a SCOP:a;2i:i.i:| Sox-5; 



pfam HMG box HMG (high mobility group) box 
InterPro IPR000910 HMG1/2 (high mobility group) box 
EMBL-EBI 



E-Value 



0.0 



0.0 



E-Value 

2.36E- 
19 

1.1E-33 



Target 
Sequence 



Sequence 

>HUGENEFL : U2 3 7 5 2_AT 

cttcctttatcgtgtctcaaggtagttgcatacctagtctggagttgtgattattttccc 
aaaaaatgtgtttttgtaattactatttctttttcctgaaattcgtgattgcaacaaagg 
cagagggggcggcgcggcggaggggaggtaggacccgctccggaaggcgctgtttgaagc 
.ttgtcggtctttgaagtctggaagacgtctgcagaggacccttttggcagcacaactgtt 
actctagggagttggtggagatatt 



Probe SequenceitS'-S') ^ ; 


Probe 
X 


Probe 
Y 


Probe 
Interrogation 


Strandedness 






Position 




CTTCCTTTATCGTGTCTCAAGGTAG 


503 


219 


1691 


Antisense 


TTATCGTGTCTCMGGTAGTTGCAT 


504 


219 


1697 


Antisehse 


TCGTGTCTCAAGGTAGTTGCATACC 


505 


219 


1700 


Antisense 


AAGGTAGTTGCATACCTAGTCTGGA 


506 


219 


. 1709 


Antisense 


GTAGTTGCATACCTAGTCTGGAGTT 


.507 


219 


1712 


Antisense • 
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GTTGCATACCTAGTCT(3(3AGTTGTG 


508 


219 


1715 . 


Antisense ; 


TACCTAGTCTGGAGTTGTGATTATT 


509 


219 


1721 


Antisense 


CTAGTCTGGAGTTGTGATTAtTTTC 


510 


219 


1724 


Antisense 


TGTGATTAI 1 1 ICCCAAAAAATGTG 


511 


219 


. :1736 


Antisense 


TTTTCCTGAAATTCGTGATTGCAAC 


512 


219. 


1781 


Antisense 


GCTCCGGAAGGCGCTGTTTGAAGCT 


513 


219 


1847 


Antisense 


GCTGTTTGAAGCTTGTCGGTCTTTG 


514 


219 


1859 


Antisense 


TGAAGCTTGTCGGTCTTTGAAGTCT 


515 


219 


1865 


Antisense 


TTGTCGGTCTTTGAAGTCTGGAAGA 


i-51 6 


219 


1871 


Antisense 


TGGAAGACGTCTGCAGAGG AGCCTT 


517, 


219 


1889 


Antisense 


AAGACGTCTGCAGAGGACCCTTTTG 


518 


219 


1892 


Antisense 


gcagaggacccttttggcagcacAa 


51 9 


219 


.1901 


Antisense 


AGCACAACTGTTACTCTAGGGAGTT 


520 


219 


1919 


Antisense 


ACTGTTACTCTAGGGAGTTGGTGGA 


521 


219 


1925, 


Antisense 


AGTCTAGGG AGTTGGTGG AG ATATT 


522 


219 


1931 


Antisense 
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home | . log put. | register | your profile | contact | site index. 



search site 



AFFYMETRIX, 



PRODUCTS 



::: GETTING 
STARTED 

-> Wizard 

« QUERY 
Expression 

-> Quick Query. 

-> Standard Query 

-> Batch Query. 

-> BLAST 

-> Probe Match 

-> UCSC Query 
Genotyping 

-> Quick Query 

;-> Standard Query 
. -> Batch Query 

-> UCSC Query 

->SNP Finder 

CURRENT QUElRY 
1 probe sets 

-> Annotations 
-> Show Orthologs 
-> GO Browser 
-> Export 

:: QUERY HISTORY 

Annotation Views 
-> Expression 
-> Genotyping 

-> BLAST Status 

-> New Folder 
^Expression 
Queries 

->(1)AII Descriptions 

(M12625_at).. 
->(l)Aii Descriptions 

(U23752_at) 
->{1)AII Descriptions 
. (HG1800- 

HT1823_at) 
->(1)AII Descriptions 

(U15008_at) 
->(1)AII Descriptions 

(HG3523) 
-> Genotyping 
Queries 




SUPPORT | TECHNOLOGY I RESEARCH COMMUNITY j CORPORATE . 

Full Record 



Details for HUGENEFL:M12625_AT 

Fullscreen';-. 

Cluster Members 
Consensus/Exemplar 

GeneChip Array Information 

M12625 at : ? 



-> START ^ 



NetAffx Links 



Probe Set ID 
GeneChip 
Array 

Organism 
Common 
Name 



Transcript ID 
Sequence 
Type 



HumanGeneFL Array 



Human 



Probe Design Information 



M12625 

Exemplar sequence 



^Public ID* M1262.5 {j£Bl 



Target 
Description 



Assembly 
Alignment(s) 



Overlapping 
Transcripts 



Gene Title 
Gene Symbol 
Chromosomal 

Location 

UniGene ID 
Ensembl 
LocusLihk 

SwissProt 

EC 
OMIM 



M1 2625, class. B, 20 probes, 13 in M12625mRNA 893-1259: 7 in 
reverseSequence, 1599-1683, Human lecithin-cholesterol acyltransferase mRNA, 
complete cds, with 5' and 3' flanking DNA sequences 

Genomic Alignment of Target Sequence 
April 2003 (NCBI 33) ■ 

Position % Identity Cytoband 

Chr16: 67749925-67750484 (-) UCSC 100 q22.1 



M12625 NCBI 



Representative UniGe ne Description 
Transcript 

lecithin-cholesterol 
acyltransferase 

Public Domain and Genome References 
lecithin-cholesterol acyltransferase 
LCAT HGNC 

16.q22.1- 

Hs!387239 NCBI (FULL LENGTH) . \ 

ENSG00000.1 03080 Eosernb! 

3931 NCBI ' 

AAP88750 EMBL-EBI 
P04180 EMBL-EBI 

2.3AA3 

606967 NCBI . 



Position 

chrl 6:67749888-67754507 (-) 
UCSC 
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RefSe( j D Protein NPJ60220 NCB1 



RefSeq 



Ortholog 



Protein 
Similarities 



Protein 
Families 



RefSeq Transcript ID RefSeq Title 

NM_000229 NCBI leqthin-cholesterol acyltransferase precursor 

Functional Annotations 

Title 

lecithin cholesterol 
acyltransferase 

lecithin cholesterol 
acyltransferase 

lecithin cholesterol 
acyltransferase ; 

MU11KSUBA:J05154 S AT lecithin cholesterol 

acyltransferase 

lecithin cholesterol 
acyltransferase 

lecithin cholesterol 
acyltransferase 

lecithin cholesterol 
acyltransferase 

MOUSE43QA 2:1417043 AT lecithin cholesterol 

acyltransferase 



ID 

MG-U74AV2:103Q23 AT \ 
MG-U74AV2:161759 R AT 
MOE430A: 141 7043 AT 



RAE230A:1367887 AT 
RG-U34A:X54096 AT . 
MOUSE430 2:1417043 AT 



Gene Ontology 



Organism 
Mouse 

Mouse 

Mouse 

Mouse \ 

Rat 

Rat 

Mouse 

Mouse 



GO Biological Process (view graph) 
ID Description 

6629 lipid metabolism 

GO Cellular Component (view graph) 
ID Description 

5576 extracellular. 

GO Molecular Function (view graph) 

ID Description 

4607 phosphatidylcholine-sterol O-^ 
acyltransferase activity 

8415 acyltransferase activity 
1 6740 transferase activity 



Evidence 

inferred from 
electronic annotation 

Evidence 
not recorded 



Evidence 

inferred from 
electronic annotation 

inferred from 
electronic annotation 

inferred from 
electronic annotation 



Methoci 

blast, 
blast 

Method 



Description 



Type 

Gurated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 



Links 

QuickGO 
AmiGO 



Links 

QuickGO 
AmiGO 



Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 

QuickGO 
AmiGO 



ID 

32879837 

4557892 lecithin-cholesterol acyltransferase precursor 



[Homo sapiens] 



ID 



Description 



ec LCAT HUMAN LCAT_HUMAN 

EC:2.3.1'43:PHOSPHATIDYLCHOLINE- . 
STEROL ACYLTRANSFERASE PRECURSOR 
• '. (EC 2.3.1 .43) (LECITHIN-CHOLESTEROL 
ACYLTRANSFERASE) (PHOSPHOLIPID- 
CHOLESTEROL ACYLTRANSFERASE). 



E-Value 

0.0 

0.0 

E- 
Value 

1.85E- 
171 



Database 



ID 



Description 



E-Value 
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Protein 
Domains 



scop d1tca__ d1tca_ SCOP:c.69.1 :17:| Triacylglycerol lipase 5.3E-8 

pfam * LACT Lecithin:choiesterol acyltransferase 1.7E- 
• '." ' ' ..'182 ; 

InterPro IPR003386 Lecithinichplesteroi acyltransferase 
EMBL-EBI 

InterPro IPR008262 Lipase, active site 

EMBL-EBI . 



Target 
Sequence 



Trans Membrane 
ID 

NP 000220 . 2 



Number Of probability of Interior N-Terminus 
Domains : 

0.05945 



Sequence 

>HUGENEFL : Ml 2 6 2 5_AT . 

cttcaactacacaggccgtgacttccaacgcttctttgcagacctgcactttgaggaagg 
ctggtacatgtggctgcagtcacgtgacctcctggcaggactcccagcacctggtgtgga 
agtatactgtctttacggcgtgggcctgcccacgccecgcacctacatctacgaccacgg 
cttcGCctacacggaccctgtgggtgtgctctatgaggatggtgatgacacggtggcgac 
ccgcagcaccgagctctgtggcctgtggcagggccgccagccacagcctgtgcacctgfct 
gcccctgcacgggatacagcatctcaacatggtcttcagcaacctgaccctggagcacat 
caatgccatcctgctgggtgcctaccgccagggtccccctgcatccccgactgccagccc 
agagcccccgcctcctgaataaagaccttcctttgctaccgtaagccctgatggctatgt 
ttcaggttgaagggaggcactagagtcccacactaggtttcactGctcaccagccacagg 
ctGagtgctgtgtgcagtg - 



Probe SequencefS'-a*) 



Probe 
X 



Probe Info 



CTTCAACTACACAGGCCGTGACTTC 152 

CTACACAGGCCGTGACTTCCAACGC 153 

CCAACGCTTCTTTGCAGACCTGGAC 1 54 

CCTGCACTTTGAGGAAGGCTGGTAC .' 1 55 

CATGTGGCTGCAGTCACGTGACCTC 1 56 

GCTGCAGTCACGTGAGCTCCTGGCA 1 57 

CCTGGCAGGACTCCCAGCAGCTGGT 158 

GGACCCTGTGGGTGTGCTQTATGAG 159 

TGTGCTCTATGAGGATGGTGATGAC 160 

GGCGACCCGCAGCACCGAGCTQTGT 161 

CCTGACCCTGGAGCACATCAATGCC 162 

GCACATCAATGCCATCCTGCTGGGT |63 

CATCCTGCTGGGTGCCTACCGCCAG 164 

CTTTGCTACCGTAAGCCCTGATGGC 1 65 

TACCGTAAGCCCTGATGGCTATGTT 166 

AAGCCCTGATGGGTATGTTTCAGGT 1 67 

CTATGTTTCAGGTTGAAGGGAGGCA 168 

GGAGGCACTAGAGTCCCACACTAGG 169 

GTCCCACACTAGGTTTCACTCCTCA 170 

CACAGGCTCAGTGCTGTGTGCAGTG 171 



Probe 

•Y' 

127 

127 

127 

127 

127 

127 

127 . 

127 

127 

127 

127 

127 

.127 

127 

127 

127 

127 

127 

127. 

.127 



Probe 
Interrogation 

Position 

1161 

1167 

1185 

1203 

1227 

1233 

1251 

1353 

1365 

1395 

1503 

1515 

1527 

1611 

1617 - 

1623 

1635 

1653 

1665 
.1695 



Strandednesj 

Aritisense 

Antisense 

Antisense 

Antjsehse 

Antisense 

Antisense 

Antisense 

Antisense 

Antisense 

Antisense 

Antisense ; 

Antisense 

Antisense 

Antisense 

Antisense 

Antisense 

Aritisense 

Antisense . 

Antisense 

Antisense 
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